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Although there have been many investigations into how trinucleotide repeats affect nucleosome formation and 
local chromatin structure, the nucleosome positioning of GAA triplet-repeats in the human genome has remained 
elusive. In this work, the nucleosome occupancy around GAA triplet-repeats across the human genome was 
computed statistically. The results showed a nucleosome-depleted region in the vicinity of GAA triplet-repeats 
in activated and resting CD4* T cells. Furthermore, the A-tract was frequently adjacent to the upstream region 
of GAA triplet-repeats and could enhance the depletion surrounding GAA triplet-repeats. In vitro chromatin 
reconstitution assays with GAA-containing plasmids also demonstrated that the inserted GAA triplet-repeats 
destabilized the ability of recombinant plasmids to assemble nucleosomes. Our results suggested that GAA 
triplet-repeats have lower affinity to histones and can change local nucleosome positioning. These findings 
may be helpful for understanding the mechanism of Friedreich's ataxia, which is associated with GAA triplet- 
repeats at the chromatin level. 
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1. Introduction 


Nucleosome formation on repeated DNA fragments has been inves- 
tigated for many years [1,2]. The previous works showed that CAG 
trinucleotide repeats associated with diseases such as myotonic dystro- 
phy type 1 and spinocerebellar ataxia type 1 are strong nucleosome 
positioning signals [2-5]. The CGG triplet-repeats associated with fragile 
X syndrome have low affinity to histone [2,6-9]. Both the pure and 
interrupted ATTCT pentanucleotide repeats associated with spinoce- 
rebellar ataxia type 10 have strong affinities to histones in vitro [1]. 

Friedreich's ataxia (FRDA), an autosomal recessive neurodegenera- 
tive disease, is the most common hereditary ataxia. FRDA is caused by 
the expansion of GAA triplet-repeats in the first intron of the frataxin 
(FXN) gene (OMIM 606829) [10,11]. Although early biochemical studies 
have documented that GAA triplet-repeats adopt a non-B-DNA struc- 
ture, such as triplexes and ‘sticky’ DNA, no experimental evidence has 
demonstrated the presence of non-B-DNA structure in the chromosomal 
FXN in cells isolated from FRDA patients [12-16]. Recently, increasing 
evidence has shown that GAA triplet-repeat-induced chromatin changes 
may be responsible for the FXN mRNA deficits. Saveliev et al. found that 
expanded GAA triplet-repeats served as a source of HP-1-mediated het- 
erochromatin to silence a nearby reporter gene [17]. Subsequently, 


Abbreviations: FXN, frataxin; FRDA, Friedreich's ataxia; EM, electron microscopy; AUC, 
analytical ultracentrifugation. 
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several studies have suggested that the expansion of the FRDA GAA 
triplet-repeats can cause epigenetic changes, such as H3-K9 hyper- 
methylation and reductions in histone acetylation. The GAA triplet- 
repeats can also recruit heterochromatin-binding proteins. Further, the 
formation of heterochromatin on intron 1 of the FXN gene may trigger 
FXN gene silencing through an intrinsic blocking of transcription initia- 
tion or elongation [11,18-27]. Ruan et al. found that the efficiency of 
nucleosome assembly on pure GAA, repeat duplex was only half that 
of the pUC control DNA. Compared with duplex DNA in vitro, the GAA/ 
GAA/TTC triplex further reduced nucleosome assembly efficiency [10]. 

Previous works have focused primarily on the chromatin structure of 
disease-length GAA triplet-repeats, and less information is available on 
how GAA triplet-repeats affect the local nucleosome positioning in 
healthy individuals. However, it is important to consider the behavior 
of both healthy and disease length repeat tracts at the nucleosome level. 

Analyzing the nucleosome distribution around GAA triplet-repeats 
across an entire organism's genome is more feasible with the produc- 
tion of high-resolution maps of in vivo nucleosome positions that were 
generated by high-throughput sequencing techniques. In this study, 
the distribution of the nucleosome occupancy around GAA triplet- 
repeats throughout the human genome was analyzed based on pub- 
lished high-throughput experimental data [28]. Moreover, differences 
in chromatin structure among DNA fragments containing GAA triplet- 
repeats, CAG triplet-repeats, and Widom 601 sequences were investi- 
gated using chromatin reconstitution assay. The results suggested that 
GAA triplet-repeats have lower affinity to histone octamers and could 
change local nucleosome positioning. 
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2. Results 
2.1. GAA triplet-repeat distribution in the human genome 


To determine the frequencies of GAA triplet-repeats in the human 
genome, all of the uninterrupted GAA triplet-repeat tracts consisting 
of five or more repeated units were identified in the human genome 
(Fig. 1). As expected, the frequency of the GAA triplet-repeats was 
different for each chromosome. For example, the frequency of GAA 
triplet-repeats in chromosome 1 was the highest, and that in chro- 
mosome Y was the lowest. In contrast, the frequency of GAA triplet- 
repeat tracts decreased roughly exponentially with repeat length. The 
most frequent GAA triplet-repeat copies were 5, 6, 7, and 8, with more 
than one hundred frequencies. The frequency of the GAA triplet- 
repeats of 9 to 25 copies ranged from 47 to 13. In particular, only 6 
fragments were found that contain more than 32 GAA triplet-repeats 
in the genome. 


2.2. Nucleosome occupancy around GAA triplet-repeat regions 


To investigate whether GAA triplet-repeats affected nucleosome 
arrangement in vivo, we first analyzed the nucleosome occupancy in 
the vicinity of repeat sites in the human genome. As shown in Fig. 2A, 
the center of the repeats showed strong depletion of nucleosomes in 
activated CD4* T cells, which are flanked by a small nucleosome valley. 
The nucleosome occupancy on approximately 200 bp downstream of 
GAA triplet-repeats approached the nucleosome level on sequences 
selected randomly from chromosomes 1-5 (Supplemental Fig. S1). 
This result suggested that GAA triplet-repeats affected local nucleosome 
positioning in vivo. 

For comparison, the nucleosome occupancy across other repeat 
tracts was also analyzed. Long, pure CAG triplet-repeats are stronger 
nucleosome positioning elements in vitro [4,8]. However, no nucleo- 
some positioning peaks were detected at the short CAG triplet-repeats 
in vivo (Fig. 2B), although the short CAG triplet-repeats were also not 
shown to alter nucleosome occupancy in vitro. Similar to GAA triplet- 
repeats, the centers of the TAA and CAA triplet-repeats also showed 
strong nucleosome depletion (Supplemental Figs. S2 and S3). 


2.3. Comparison of nucleosome occupancy surrounding GAA triplet-repeats 
in different T cells 


A recent study suggested that nucleosome location varies in differ- 
ent cell environments, and that several nucleosomes are either removed 
or shifted thus making the DNA sequence more accessible to regulatory 
factors [28]. To determine whether the nucleosome structure on GAA 
triplet-repeats was dynamically regulated in different cell environ- 
ments, we compared the nucleosome distributions of the repeat regions 
in both activated and resting CD4* T cells. As shown in Fig. 3, no obvious 
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differences in nucleosome profiles near GAA triplet-repeats were 
observed between activated and resting CD4* T cells. However, a signif- 
icant increasing tendency (paired-sample t test, P < 1.7e-59) in nucleo- 
some occupancy, corresponding to all of the GAA triplet-repeat tracts, 
was found in the resting cells compared with the activated cells. 


2.4. A-tracts at the 5’ end of GAA triplet-repeats enhance the local 
nucleosome depletion in vivo 


Surprisingly, when we analyzed the distribution of GAA triplet- 
repeats, A-tracts were frequently adjacent to GAA triplet-repeats in 
the human genome. We then tested the A contents in 10-bp upstream 
regions of GAA triplet-repeats, using other triplet-repeats as controls, 
such as TAA, CAC, CAG, CTG, and TTG. As shown in Fig. 4A, for all of 
5-8 trinucleotide repeats, the percentages of GAA triplet-repeats with 
7-9 and 10 adenine nucleotides in 10-bp regions in their 5’ ends were 
24.2% and 33.7%, respectively. The corresponding percentages were 
19% and 3.3% for TAA triplet-repeats. Almost 5% of CAC or CAG triplet- 
repeats exhibited more than 6 adenine nucleotides in the upstream 
region compared with only slightly more than 0.2% of the CTG or TTG 
triplet-repeats. 

To test the contribution of A-tracts at the 5’ end of the repeats to the 
GAA triplet-repeats inducing nucleosome depletion, GAA-containing 
sequences were classified into two groups according to the presence 
of 0-6 or 7-10 adenine nucleotides in the 10-bp upstream regions of 
repeats. The nucleosome occupancy mapping around GAA5 and GAA6 
triplet-repeats are each plotted for activated and resting T cells in 
Fig. 4B and Supplemental Fig. S4. Consistent with the findings men- 
tioned above, pronounced nucleosome depletion regions were found 
in the vicinity of all of the triplet-repeats. A significant difference 
(paired-sample t test, P < 8e-38) was found in average nucleosome 
occupancy among the different A contents upstream of the triplet- 
repeats. Compared with valleys containing a lower A content with 0-6 
adenosines, the deeper nucleosome level valleys surrounding the 
repeats were observed on the sequences containing 7-10 adenosines. 
In addition, we investigated the contribution of downstream GAA 
triplet-repeats to nucleosome depletion around the A-tract. The A-tract 
in the genome can be divided into two categories: A-tracts with and 
without downstream GAA triplet-repeats. All of the sequences were 
aligned with the A-tract start position. Supplemental Fig. S5 showed 
that the A-tract excluded the local nucleosome and that nucleosome 
depletion was positively correlated with A-tract length. The local nucle- 
osome level on the A-tract with downstream GAA triplet-repeats was 
obviously lower than that on the A-tract without downstream GAA 
triplet-repeats. These results suggested that the GAA triplet-repeats 
could induce local nucleosome depletion in vivo and that the A-tract 
upstream of repeats could further enhance the depletion surrounding 
GAA triplet-repeats. 
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Fig. 1. Distributions of GAA trinucleotide repeats in the human genome. The y-axis denotes repeat copy numbers (repeats number > 5), and the x-axis denotes chromosome length. The 


chromosome length bar is 1E8 bp. 
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Fig. 2. Nucleosome occupancy level in a 2000-bp region relative to the start site of (A) GAA triplet-repeats and (B) CAG triplet-repeats. The different colors and styles of the curves 
represent the distribution of average nucleosome occupancy around the repeated tract from 5 to 8 copies. GAAS (N = 1344), GAA6 (N = 550), GAA7 (N = 191), GAA8 (N = 108), 
CAG5 (N = 578), CAG6 (N = 238), CAG7 (N = 120), and CAG8 (N = 58) were evaluated, and N denotes the number of the statistical samples. 


We also analyzed the nucleosome occupancy patterns on TAA and 
CAA triplet-repeats with a similar technique. Compared with valleys 
containing 0-6 adenosines, the deeper nucleosome valleys surrounding 
CAA triplet-repeats were observed on the sequences containing 7-10 
adenosines (Supplemental Fig. S6). However, no obvious difference 
was found in average nucleosome occupancy between the TAA triplet- 
repeats with different upstream A contents (Supplemental Fig. S7). 
Furthermore, while the CAA and TAA triplet-repeats were adjacent to 
the downstream region of A-tracts, the local nucleosome depletion 
surrounding A-tracts was also obviously enhanced (Supplemental 
Figs. S8 and S9). 

The effects of several triplet-repeat sequences and A-tracts on the 
distribution of nucleosome occupancy were analyzed statistically. 
Here, we preferred to pay greater attention to special regions of the 
genome. The first example was the region of the FXN gene in healthy 
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individuals. A poly(A)i6 segment was adjacent to the 5’ end of GAA6 
triplet-repeats in the first intron of the FXN gene. The nucleosome 
profiles in the region from exon 1 to part of intron 1 of the FXN gene 
for activated (top panel) and resting (middle panel) T cells are shown 
in Supplemental Fig. S10. Total nucleosome depletion was found from 
the A-tract to approximately 100 bp downstream of repeats in activated 
T cells. Compared with the weak nucleosome occupancy at GAA triplet- 
repeats, remarkable nucleosome depletion from approximately 60 bp 
upstream of the A-tract to the triplet-repeat start site was observed 
in resting T cells. These data indicated that nucleosome depletion on 
GAA triplet-repeats in the FXN genes of healthy individuals was consis- 
tent with the statistical results. 

We further analyzed nucleosome occupancy on the six fragments 
containing more than 32 GAA triplet-repeats and their flanking se- 
quences in the human genome. Rare A-tracts neighboring the 5’ end 
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Fig. 3. Nucleosome occupancy level in a 2000-bp region relative to the start site of (A) GAAS triplet-repeats, (B) GAAG triplet-repeats, (C) GAA7 triplet-repeats, and (D) GAA8 


triplet-repeats in activated or resting CD4* T cells. 
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Fig. 4. Adenosines contents at the 5’ end of GAA triplet-repeats and their effect on the local nucleosome structure in vivo. A. A contents in a 10-bp upstream region of GAA, TAA, CAC, CAG, 
CTG, and TTG triplet-repeats. Different color plots in each bar denote the sequence percentages with 0-3, 4-6, 7-9, or 10 A in 10-bp upstream triplet-repeats, respectively. B. Nucleosome 
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(N = 594) and GAA5-A(7-10) (N = 750) were evaluated, and N denotes the number of the statistical samples. 


of GAA triplet-repeats were found (Supplemental Table S1). We com- 
puted the nucleosome occupancy over these six selected fragments in 
activated and resting T cells. As shown in Supplemental Fig. S11, total 
nucleosome depletion over GAA32, GAA33, GAA35, GAA43, and 
GAAS6 triplet-repeats and partial nucleosome depletion on GAA5O 
triplet-repeats were found in activated T cells. No obvious difference 
was found in nucleosome distribution between activated cells and 
resting cells for GAAn triplet-repeats (n = 33, 35, 43, 50, and 56) 
(Supplemental Fig. $12). 


2.5. GAA triplet-repeats change the local chromatin structure in vitro 


To examine whether GAA triplet-repeats affected the local chroma- 
tin structure in vitro, we constructed recombinant plasmids containing 
GAA7, GAA27, GAA42, CAG7, CAG27, and CAG44 triplet-repeats and 
the Widom 601 sequence in multiple cloning sites of pUC19 (Fig. 5A). 
The GAA and CAG triplet-repeat lengths used were healthy-length 
repeat tracts. Using a recombinant histone octamer that was expressed 
and purified from bacteria and that lacked all posttranslational modifi- 
cations (Fig. 5B), chromatins were assembled on plasmids-containing 
GAA and CAG triplet-repeats and the 601 sequence. The successful 
formation and alignment of nucleosomes were initially confirmed by 
micrococcal nuclease digestion (Supplemental Fig. S13). 

First, we examined the chromatin structure assembled on the 
GAA42- and 601-containing circular recombinant DNA templates 
using electron microscopy (EM) and sucrose density gradient centrifu- 
gation. EM images showed that the chromatins assembled with two 
plasmids displayed slight compaction, and no obvious differences 
were found in chromatin structure (Supplemental Fig. $14). To further 
examine the extent of chromatin assembly, the reconstituted chromatin 
was analyzed by sucrose gradient analysis. Whereas naked DNA alone 
remained in fractions 6-7 (Fig. 5C (a)), chromatin migrated toward 
the center or bottom of the sucrose gradient (Fig. 5C (a), fractions 
9-12 of chromatin on pUCGAA42 and fractions 10-15 of chromatin on 
pUC601). A shift of approximately 2 fractions to the bottom was 
observed when the GAA42 triplet-repeat insert was replaced by the 
601 sequence. These data suggested that chromatin arrays on the GAA 
triplet-repeat-containing template adopted a looser and more open 
structure than that on the 601-containing template. 

The formation of one nucleosome on a closed circular DNA template 
could cause a change in the linking number of approximately — 1.0 
[29-31]. To eliminate the topological effects caused by DNA supercoils 
in the nucleosome assembly assay, we reconstituted chromatin on the 
linear plasmid templates (Fig. 5A). The reconstituted chromatins were 
analyzed by EM to detect any visible changes of the chromatin structure 


on the linear template with GAA42 triplet-repeats and 601 sequence 
inserts (Supplemental Fig. S14). Chromatin assembled in vitro displayed 
a “beads on a string” structure. Sucrose gradient analysis revealed that 
whereas linear DNA alone remained close to the top (Fig. 5C (b), fraction 
4), the chromatins assembled on pUCGAA42 and on pUC601 migrated 
toward fractions 7-11 and fractions 9-14 of the sucrose gradient, 
respectively. Similar to the shift in the case of the circular DNA tem- 
plates, a shift of approximately 2 fractions to the bottom was also 
found for bands of chromatin assembled on linear pUC601 compared 
with linear pUCGAA42. The results were similar to the aforementioned 
results of chromatin reconstitution assay on circular templates. 

For the sedimentation detection of reconstituted chromatin on 
plasmids, the lengths of plasmids likely influence the observed differ- 
ences between plasmids containing GAA42 triplet-repeats and the 
plasmid containing the 601 sequence. The recombinant plasmid lengths 
containing GAA42 triplet-repeats and the 601 sequence were 2816 and 
2873 base pairs, respectively. Comparing the recombinant plasmid 
length, the approximately 57-bp length differences may have slightly 
effect on the reconstituted chromatin structure in vitro. To further test 
the effects of repeat sequences on chromatin structure quantitatively, 
the analytical ultracentrifugation (AUC) in sedimentation velo- 
city experiments was analyzed for assembled chromatin with GAA- 
containing circular plasmids, using CAG-containing, and 601-containing 
plasmids as controls. The Saye for the chromatin on the GAA42- 
containing plasmid and on the 601-containing plasmid was 43 S and 
51 S, respectively, which agreed with the sucrose gradient data. The 
Save Values of the chromatin on the GAA (n) triplet-repeats (n = 7, 27, 
42) were slightly smaller than those on the CAG (n) triplet-repeats 
(n = 7, 27, 44) (Fig. 5D). This finding was consistent with the results 
derived from the previous analysis of the nucleosome occupancy on 
GAA and CAG triplet-repeats in the present work. The results also 
suggested that GAA triplet-repeat inserts destabilized the ability of 
recombinant plasmids to assemble nucleosomes. 


3. Discussion 


In the present study, the effects of GAA triplet-repeats on the local 
chromatin structure were investigated based on nucleosome position- 
ing mapping data of the human genome in vivo and on an in vitro 
chromatin reconstitution assay. The results showed that the GAA 
triplet-repeats could induce local nucleosome depletion in vivo, and 
the A-tract upstream of repeats could further enhance the depletion 
surrounding GAA triplet-repeats. Compared with 601 sequence and 
CAG triplet-repeat inserts, the GAA triplet-repeat inserts in recombi- 
nant plasmids decreased nucleosome formation ability in the in vitro 
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reconstitution assay, which indicates that the GAA triplet-repeats are 
weak nucleosome-positioning elements. 

The weak nucleosome assembly on the GAA triplet-repeats was like- 
ly due to the abundance of AA dinucleotides. Nucleosome positioning 
along the genome might be determined primarily by the preference of 
DNA sequences [32-38] and by external factors, such as chromatin re- 
modeling, DNA methylation, histone variants, post-translational modifi- 
cations, and polymerase II binding [28,38,39]. An in vivo genome-wide 
survey of nucleosome positions and reconstituted chromatin structures 
in vitro suggested that local nucleosome positioning around GAA triplet- 
repeats was mediated primarily by the primary sequence. The homo- 
polymeric stretches of deoxyadenosine nucleotides were identified as 
strong determinants of nucleosome organization, such as poly(dA:dT). 
The poly(dA:dT) tract disfavors nucleosome formation [37,40] and has 
been strongly associated with nucleosome depletion in vivo [34,41] 
and in vitro [34,39,42]. Besides, DNA flexibility could also affect nucleo- 
some assembly [43]. Given the flexibility of dinucleotides, the AA dinu- 
cleotide is relatively stiff. Of the 12 trinucleotides, the flexibility of GAA 
is slightly higher than that of CAA and AAA [44]. Our previous work also 
indicated that the A-rich region (especially the poly-A region) is rela- 
tively rigid [45]. The weak flexible GAA trinucleotides in the sequence 
may have caused greater free energy for the bending of the repeats, 
allowing them to wrap around histones. Therefore the lots of energetic 
cost decrease nucleosome assembly on the triplet-repeats and the 
flanking sequence, which might alter the local nucleosome positioning 
around GAA triplet-repeats. 

Our data indicated that nucleosome depletion is present on healthy- 
length GAA triplet-repeats in the genomes of unaffected individuals. 
The low-level nucleosome occupancy on GAA triplet-repeats is condu- 
cive to the transcription of the FXN gene. 

Nucleosome positioning is dynamically regulated during the cell 
cycle and in response to regulation signals in vivo. Nucleosome arrays 
with high nucleosome occupancy are more feasible for forming the het- 
erochromatin structure. However, nucleosome occupancy is not the 
only factor to affect the formation of the heterochromatin structure 
in vivo. At high densities of nucleosomes along the DNA, nucleosome 
will be positioned over lower-affinity sites, while nucleosomes will 
favor high-affinity sites at low densities [35,37]. In our work, the nucle- 
osome depletion pattern on GAA triplet-repeats was investigated in 
cells obtained from unaffected individuals. In the disease condition, 
the nucleosome occupancy and dynamic repositioning may be primari- 
ly affected by epigenetic signals or other external factors rather than by 
the preferences of DNA sequences. The GAA triplet-repeats with low af- 
finity to octamers may have higher nucleosome occupancy in response 
to some regulatory signals, although these responses have high energy 
demands to change their conformations. 

Taken together, our results suggested that nucleosome depletion on 
GAA triplet-repeats was present and that GAA triplet-repeats had a 
weak ability to form nucleosomes in unaffected individuals. These 
results could answer questions regarding the chromatin structure asso- 
ciated with shorter-than-average expanded GAA triplet-repeats in 
FRDA, which is one of the definitive questions for the understanding 
of the pathogenesis of FRDA at the chromatin level [21]. However, 
several questions remain unclear. For instance, how the GAA triplet- 
repeat signal heterochromatin formation remains a subject of debate 
[12], and some strong evidence is also needed to investigate whether 
and how the longer expansive GAA triplet-repeats in intron 1 of the 
FXN gene affect the changes in epigenetic hallmarks in cells isolated 
from FRDA patients. Further study by chromatin structure analysis of 
GAA triplet-repeat loci in the first intron of the FXN gene across cellular 


and animal models is required to better understand the mechanisms of 
the FXN gene silencing. 


4. Materials and methods 
4.1. Genomic DNA and nucleosome positioning data 


Whole human genome sequences (version: hg18) were down- 
loaded from the UCSC Genome Browser (http://genome.ucsc.edu/). 
The human nucleosome occupancy data in activated and resting 
human CDA" T cells were obtained from published experimental data 
(http://dir.nhlbi.nih.gov/papers/Imi/epigenomes/hgtcellnucleosomes. 
aspx) [28]. All of the uninterrupted GAA triplet-repeat tracts consisting 
of five or more repeated units were identified in the human genome. 
Repeats lacking nucleosome occupancy information were excluded. 
One thousand base pair upstream and downstream of the start site of 
the repeats were selected. The average values of the nucleosome occu- 
pancy on all of the selected DNA fragments containing repeats were 
computed statistically. Customized C programs were used to examine 
the distributions of triplet-repeats and nucleosome levels around 
triplet-repeats across the entire genome. 


4.2. Preparation of DNAs and recombinant histone octamer 


The recombinant plasmid pUC(GAA)42 containing GAA42 was con- 
structed as described in the Hashem-mutation document [46]. (GAA),, 
(CTT), EcoRI adaptors (EP adaptor), and BamHI adaptors (PB adaptor) 
were chemically synthesized (Sangon, China). (GAA)z, (CIT), and the 
two adaptor oligonucleotides that were phosphorylated using ATP and 
kinase (NEB), were precipitated, and were then resuspended in TEN 
(10 mM Tris-HCl, pH 7.6, 1 mM EDTA and 50 mM NaCl). Complementa- 
ry pairs of oligonucleotides were hybridized by boiling and slowly 
cooling to 4 °C overnight. Hybridized (GAA); / (CTT); were ligated 
overnight at 4 °C in 30 ul of ligation buffer with 20 units of T4 DNA ligase 
(NEB). The resulting [(GAA) - (CIT) ]42 polymer was then ligated at 4 °C 
to hybridized EP adaptor and PB adaptor. The fragments were subse- 
quently double digested with EcoRI and BamHI and were separated on 
a 5% polyacrylamide gel. The purified fragments were then ligated 
into the EcoRI and BamHI sites in the multiple cloning site of pUC19. 
pUC(GAA)7, pUC(CAG)7, pUC(GAA)27, pUC(CAG)27, and pUC(CAG)44 
were constructed using similar methods. The recombinant plasmid 
pUC601 was constructed by adding one copy of the 601 sequence to 
the BamHI and PstI sites of the pUC19 vector. The recombinant plasmids 
were prepared sufficiently. The closed plasmids were digested with Sspl 
(NEB) to prepare the linear plasmids. The chromatins were assembled 
on closed or linear DNA templates. 

The expression and purification of histones were performed ac- 
cording to methods published previously [47,48]. Each histone was 
expressed and purified from Escherichia coli BL21 cells containing 
pET-histone expression plasmids. The four core histones in unfolding 
buffer (7 M guanidinium HCl, 20 mM Tris-HCl, pH 7.5, 10 mM DTT) 
were mixed at equimolar ratios. The mixture was dialyzed against 
at least three changes of 1 L of refolding buffer (2 M NaCl, 10 mM 
Tris-HCl, pH 7.5, 1 mM Na-EDTA, and 5 mM 2-mercaptoethanol). 
The second and third dialysis steps were performed overnight. The 
histone octamer was maintained at 4 °C. Precipitated proteins were 
removed by centrifugation for 30 min at 20,000 x g. Samples were 
concentrated to a final volume of 0.5 mL before purification through 
a Superdex S200 filtration column (GE Healthcare). Confirmation of 


Fig. 5. Chromatin assembly assay of the recombinant plasmids containing GAA triplet-repeats, CAG triplet-repeats and Widom 601 sequence in vitro. A. Schematic diagram of the 
reconstituted plasmids. GAA or CAG triplet-repeats were inserted into the EcoRI and BamHI sites of pUC19, and the Widom 601 sequence was added into PstI and BamHI sites of the same 
vector. The closed plasmids were digested with Sep) and became linear DNA. B. SDS-PAGE analysis of bacterially expressed and purified recombinant octamer. C. Sucrose gradient fractions 
of the assembled chromatin analyzed by agarose gel electrophoresis and ethidium bromide staining. D. Sedimentation coefficient distribution plots for GAA7-, GAG7-, GAA27-, CAG27-, 


GAA42-, CAG44-, and 601-containing plasmid DNA and chromatin. 
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the purity and stoichiometry of the histone octamer was performed 
using SDS-PAGE on 15% gels with Coomassie Brilliant Blue staining 
and the concentration was determined using an extinction coeffi- 
cient at 276 nm. 


4.3. Nucleosome array assembly reaction in vitro 


For in vitro structure investigation, chromatin arrays were assem- 
bled using the salt-dialysis method as previously described [47,49]. 
The reconstitution reactions were conducted with histone octamer 
and pUC(GAA)n (n = 7, 27, 42), pUC(CAG)n (n = 7,27,44) and 
pUC601 DNA templates. All of the steps were performed at 4 °C. Eight 
micrograms of each DNA were incubated in 30 uL reactions containing 
10 mM Tris-HCl (pH 8.0), 1 mM EDTA (pH 8.0), 2 M NaCl, and 9.6 ug 
of histone octamer (added last). The samples were placed in a micro- 
dialysis apparatus with 6-8 kDa dialysis tubing. They were then placed 
in a beaker containing high-salt buffer (10 mM Tris-HCl, pH 8.0, 2 M 
NaCl, 1 mM EDTA), which was continuously diluted by slowly pumping 
in TE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA) to a lower concentra- 
tion of NaCl from 2 M to 0.6 M over a period of 16 h. After this period, the 
low-salt buffer beaker was replaced with HE buffer (10 mM HEPES, 
pH 8.0, 0.1 mM EDTA) lacking any NaCl, and the nucleosomes were 
dialyzed for an additional minimum of 3 h with constant stirring. 


4.4. Analysis of nucleosome arrays 


For each reconstitution, chromatin was analyzed by EM imaging, 
sucrose density gradient centrifugation (10%-30%) or sedimentation 
velocity with AUC. 

For imaging with EM, reconstituted chromatin samples were pre- 
pared using DNA concentrations of 20 ug/mL in measurement buffer 
(10 mM HEPES, pH 8.0, 0.1 mM EDTA). The samples were fixed with 
0.4% glutaraldehyde (Fluka) in the same buffer for 30 min on ice. 
Then, 2 mM spermidine was added to the sample solution to enhance 
the absorption of the chromatins by the glow-charged carbon-coated 
grids. The samples were applied to the EM grids and were incubated 
for 2 min and then blotted. The grids were washed stepwise with 
increasing amounts of ethanol/water in 20 mL baths of 0%, 25%, 50%, 
75%, and 100% ethanol solution for 4 min, each at room temperature, 
and they were then air dried and shadowed with tungsten at an angle 
of rotation of 10°. The samples were examined using a FEI Tecnai G2 
Spirit 120 kV transmission electron microscope [49]. 

For sucrose gradients, chromatin samples were loaded on a sucrose 
gradient in HEN buffer (10 mM HEPES, 0.2 mM EDTA, 25 mM NaCl). 
The sucrose gradients were centrifuged for 16 h at 24,000 rpm in a 
Beckman 60 Ti rotor. Fractionation was performed manually (250 pL 
fractions), and samples were loaded with SDS on a 1% agarose gel, 
stained with ethidium bromide, and destained in water. 

For AUC sedimentation velocity experiments, chromatin samples 
containing 18 ug of DNA were prepared in measurement buffer 
(10 mM HEPES at pH 8.0, 0.1 mM EDTA). The sedimentation experi- 
ments were performed on a Beckman Coulter ProteomeLab XL-I using 
a four-hole An-60Ti rotor. Samples with an initial absorbance at 
260 nm of 0.5-0.8 were equilibrated for 2 h at 20 °C under a vacuum 
in a centrifuge prior to sedimentation. The absorbance at 260 nm 
was measured in continuous scan mode during sedimentation at 
30,000 x g for naked DNA and at 22,000 x g for chromatin DNA in 
12-mm double-sector cells. The data were analyzed using enhanced 
van Holde-Weischet analysis and Ultrascan II software, version 9.9 
revision 1504. The S2o,w values (sedimentation coefficient corrected 
for water at 20 °C) were calculated with a partial specific volume of 
0.622 mL/g for chromatin, and the buffer density and viscosity were 
adjusted. The average sedimentation Save coefficients were determined 
at the boundary midpoint [49]. 
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